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1 Optimal.^ 

Anna C. Gilbert, Yannis Kotidis, S. Muthukrishnan, Marin J. Strauss 

May 2001 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on 
Principles of database systems 

Additional Information: full citation , abstract , references, citings, index 



Full text available: madll88 J7.KBJ 



terms 



Fast estimates for aggregate queries are useful in database query optimization, 
approximate query answering and online query processing. Hence, there has been a lot of 
focus on "selectivity estimation", that is, computing summary statistics on the underlying 
data and using that to answer aggregate queries fast and to a reasonable approximation. 
We present two sets of results for range aggregate queries, which are amongst the most 
common queries. 

First, we focus on a histog ... 

2 MMoienaace of^ 

Inderpal Singh Mumick, Dalian Quass, Barinderpal Singh Mumick 

June 1997 ACM SIGMOD Record , Proceedings of the 1997 ACM SIGMOD international 
conference on Management of data, volume 26 issue 2 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: f§ pdf(1.5S MB) 



Data warehouses contain large amounts of information, often collected from a variety of 
independent sources. Decision-support functions in a warehouse, such as on-line analytical 
processing (OLAP), involve hundreds of complex aggregate queries over large volumes of 
data. It is not feasible to compute these queries by scanning the data sets each time. 
Warehouse applications therefore build a large number of summary tables, or materialized 
aggregate views, to ... 

3 Query„pr^ 
lan g ua g e 

Gultekin Ozsoyoglu, Victor Matos, Meral Ozsoyoglu 

December 1989 ACM Transactions on Database Systems (TODS), volume 14 issue 4 

Additional Information: Ml.citation, abstract, references, citings, index 
terms , review 



Full text available: ^Mf(3,52MB) 



Summary-Table-by-Example (STBE) is a graphical language suitable for statistical database 
applications, STBE queries have a hierarchical subquery structure and manipulate summary 
tables and relations with set-valued attributes. The hierarchical arrangement of STBE 
queries naturally implies a tuple-by-tuple subquery evaluation strategy (similar to the 
nested loops join implementation technique) which may not be the best query processing 
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strategy. In this paper we discuss the query ... 

4 XML and architecture: DSQoS-distributed architecture providing QoS in summary 
warehouses 

Joao Pedro Costa, Pedro Furtado 

November 2003 Proceedings of the 6th ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^.pdf(305,„88 KB) Additional Information: MLvMi.Qn, abstract, re^e^ces, index terms 

Data warehouses (DW) that store enormous quantities of data put a major challenge in 
what concerns performance and scalability, as users request instant answers to their 
queries. Traditional solutions rely on very expensive architectures and structures for 
speedup and scale-up. The Summary Warehouse (SW) is an inexpensive solution that has 
the potential to deliver very fast approximate answers to aggregate queries using only 
general-purpose sampling summaries. Although summaries are expected to b ... 

Keywords: OLAP, approximate query answering, data warehouse, sampling 



Summary data: Modelling summary data Q 
Rowland R. Johnson 

April 1981 Proceedings of the 1981 ACM SIGMOD international conference on 
Management of data 

Full text available: ^pd£62M2.KB) Additional Information: M citatjon, abstract, citings 

Several problems in specifying aggregate functions in relational systems are investigated. 
We propose a solution to these problems in the form of an extension of the relational data 
model. In particular we introduce the concept of summary data. The query language 
STRAND is presented in order to describe retrieval operations on the extended model. 
STRAND allows a user to formulate queries involving aggregate functions without 
conceptualizing the query in terms of aggregation. Two example applicat ... 

On optimizing summary-table-by-example queries Q 
Gultekin Ozsoyoglu, Victor Matos 

March 1985 Proceedings of the fourth ACM SIGACT-SIGMOD symposium on Principles 
of database systems 

Full text available: ®pd£1 : 1.0.MB.i Additional Information: MLQltation, references, cj.ti.ngs 



Research sessions: spatial data: Spatially-decaying aggregation over a network: mode! Q 

and.aJgorjthms 

Edith Cohen, Haim Kaplan 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Full text available: ^ pdf(358.49 KB) Additional Information: full citation , abstract , references 



Data items are often associated with a location in which they are present or collected, and 
their relevance or influence decays with their distance. Aggregate values over such data 
thus depend on the observing location, where the weight given to each item depends on its 
distance from that location. We term such aggregation spatially-decaying. Spatially-decaying 
aggregation has numerous applications: Individual sensor nodes collect readings of an 
environmental parameter such as contaminatio ... 

8 IM. aggr^^^ | 

M. Rafanelli, A. Bezenchek, L. Tininini 

December 1996 ACM SIGMOD Record, Volume 25 issue 4 

Full text available: pdfj ( 640.68 KB) Additional Information: full citation , abstract , citings , index terms 
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In this paper we describe the fundamental components of a database management system 
for the definition, storage, manipulation and query of aggregate data, i.e. data which are 
obtained by applying statistical aggregations and statistical analysis functions over raw 
data. In particular, the attention has been focused on: (1) a data structure for the efficient 
storage and manipulation of aggregate data, called ADaS; (2) the graphical structures of 
the aggregate data model ADAMO for a more use ... 

9 Dataware Q 
wireless environments 
Mohamed A. Sharaf, Panos K. Chrysanthis 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: ffi pdf(251 10 KB) Additional Information: Ml .citation, abstract., references, citings, index 

" terms 

With the rapid growth in mobile and wireless technologies and the availability, 
pervasiveness and cost effectiveness of wireless networks, mobile computers are quickly 
becoming the normal front-end devices for accessing enterprise data. In this paper, we are 
addressing the issue of efficient delivery of business decision support data in the form of 
summary tables to mobile clients equipped with OLAP front-end tools. Towards this, we 
propose a new on-demand scheduling algorithm, called SBS ... 

Keywords: broadcast pull, broadcast scheduling, mobile computing 



0 A language and a physical organization technique for summary tables Q 
Gultekin Ozsoyoglu, Z. Meral Ozsoyoglu, Francisco Mata 

May 1985 ACM SIGMOD Record , Proceedings of the 1985 ACM SIGMOD international 

conference on Management of data, volume 14 issue 4 
Full text available: S^dfi1..20.MB) Additional Information: fui[cjtation, references, citings, index terms 



11 Performance evaluation of the statistical aggregation by categorization in the SM3 Q 
system 

C. K Baru, S. Y. W. Su 

June 1984 ACM SIGMOD Record , Proceedings of the 1984 ACM SIGMOD international 

conference on Management of data, Volume 14 issue 2 
Full text available: *| | pdf(132 MB) Additional Information: full citation , abstract , references 

To perform a statistical aggregation operation over a large file often requires that the 
records of the file be divided into categories based on the values of the attribute(s) over 
which some statistical computation is to be performed. It is rather inefficient to perform the 
necessary data transfer, categorization and statistical computation using a single processor 
Parallel algorithms designed for multiprocessor systems have been proposed and their 
performance improvement over the conventional ... 

12 Extending rejMiQD Q 

agacegatejunctjons 

G. Ozsoyoglu, Z. M. Ozsoyoglu, V. Matos 

November 1987 ACM Transactions on Database Systems (TODS), volume 12 issue 4 

Full text available' Wl pdf(1 80 MB) Additional Information: full citation , abstract, references , citings , index 

terms 

In commercial network database management systems, set-valued fields and aggregate 
functions are commonly supported. However, the relational database model, as defined by 
Codd, does not include set-valued attributes or aggregate functions. Recently, Klug 
extended the relational model by incorporating aggregate functions and by defining 
relational algebra and calculus languages. In this paper, relational algebra and relational 
calculus database query languages (as defined by Klug) ... 

c g e cf c 



Results (page 1): de-sensitize aggregated summary 



Page 4 of 6 



3 Incremental update to aggregated information for data warehouses over internet Q 
Miranda Chan, Hong Va Leong, Antonio Si 

November 2000 Proceedings of the 3rd ACM international workshop on Data 
warehousing and OLAP 

Full text available: ^.pdg248 t 52 KB). Additional Information: MLQitation, Merences, .sitings, indexjenro, review 



Keywords: Internet, aggregated information, data warehouse, distributed databases, 
incremental refresh and propagate 



14 Oniine analytic processing (OLAP): QC-trees: an efficient summary structure for Q 
semantic OLAP 

Laks V. S. Lakshmanan, Jian Pei, Yan Zhao 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available: f gl pdff375 81 KB) Additional Information: MLQitation, abstract, references, citings, index 
^ " : ^ terms 

Recently, a technique called quotient cube was proposed as a summary structure for a data 
cube that preserves its semantics, with applications for online exploration and visualization. 
The authors showed that a quotient cube can be constructed very efficiently and it leads to 
a significant reduction in the cube size. While it is an interesting proposal, that paper leaves 
many issues unaddressed. Firstly, a direct representation of a quotient cube is not as 
compact as possible and thus still wast ... 

15 .UMna.ag0;eflation and Q 
Jade Goldstein, Steven F. Roth 

April 1994 Proceedings of the SIGCHI conference on Human factors in computing 
systems: celebrating interdependence 

Full text available: ^|>df(£28J6„KB) Additional information: full.citatjon, references, citings, jndexjeims 



Keywords: data exploration, data visualization, graphics presentation, intelligent 
interfaces, interactive techniques, large data sets 



16 Aspect : M 
configuratioQ 

Mark C. Chu-Carroil, James Wright, David Shields 

November 2002 Proceedings of the tenth ACM SIGSOFT symposium on Foundations of 
software engineering 

Full text available- IfS pdf(2Q7 31 KB) Addit ' onal Information: Mi .citation, abstract, rejexences, citings, index 
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Fine-grained software configuration management offers substantial benefits for large-scale 
collaborative software development, enabling a variety of interesting and useful features 
including complexity management, support for aspect-oriented software development, and 
support for communication and coordination within software engineering teams, as 
described in [4]. However, fine granularity by itself is not sufficient to achieve these 
benefits. Most of the benefits of fine granularity result from ... 

Keywords: aggregation, dynamic program organization, fine grained storage 
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software configuration management 

Mark C. Chu-Carroll, James Wright, David Shields 

November 2002 ACM SIGSOFT Software Engineering Notes, volume 27 issue 6 

Full text available: ^pdf(.1 : 05.fy1Bi Additional Information: fuJJ. citation, abstract, references, indexjerms 

Fine-grained software configuration management offers substantial benefits for large-scale 
collaborative software development, enabling a variety of interesting and useful features 
including complexity management, support for aspect-oriented software development, and 
support for communication and coordination within software engineering teams, described 
in [4]. However, fine granularity by itself is not sufficient to achieve these benefits. Most of 
the benefits of fine granularity result from th ... 

Keywords: aggregation, dynamic program organization, fine grained storage 



18 A universai-scheme approach to statistical databases containing homogeneous 
summa^tabjes 
Francesco M. Malvestuto 

December 1993 ACM Transactions on Database Systems (TODS), volume is issue 4 

Full text available: l || |pdf(2.0Q MB) Additional Information: full citation , references , citings , index terms , review 



Keywords: bipartite graph, category relation, query-answering system, statistical 
database, summary table, universal classification scheme 



19 Research sessions: indexing and tuning: Transaction support for indexed summary □ 
views 

Goetz Graefe, Michael Zwilling 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Full text available: |^ pdf(158.70 KB) Additional Information: full citation , abstract, references 

Materialized views have become a standard technique for performance improvement in 
decision support databases and for a variety of monitoring purposes. In order to avoid 
inconsistencies and thus unpredictable query results, materialized views and their indexes 
should be maintained immediately within user transaction just like indexes on ordinary 
tables. Unfortunately, the smaller a materialized view is, the higher the concurrency 
contention between queries and updates as well as among concurrent ... 

F. M. Malvestuto 

June 1988 ACM SIGMOD Record , Proceedings of the 1988 ACM SIGMOD international 
conference on Management of data, volume 17 issue 3 

Full text available- Wl pdf(899 44 KB^ Additional Information: full citation , abstract, references , citings , index 
* terms 

Given a statistical database consisting of two summary tables based on a common but not 
identical classification criterion (e.g., two geographical partitionings of a country) there are 
additional summary tables that are derivable in the sense that they are uniquely (i.e., with 
no uncertainty) determined by the tables given. Derivable tables encompass not only, of 
course, "less detailed" tables (that is, aggregated data) but also "more detailed" table ... 
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